Grayson County
MedVision: Dataset and Benchmark for Quantitative Medical Image Analysis
Yao, Yongcheng, Zong, Yongshuo, Dutt, Raman, Yang, Yongxin, Tsaftaris, Sotirios A, Hospedales, Timothy
Current vision-language models (VLMs) in medicine are primarily designed for categorical question answering (e.g., "Is this normal or abnormal?") or qualitative descriptive tasks. However, clinical decision-making often relies on quantitative assessments, such as measuring the size of a tumor or the angle of a joint, from which physicians draw their own diagnostic conclusions. This quantitative reasoning capability remains underexplored and poorly supported in existing VLMs. In this work, we introduce MedVision, a large-scale dataset and benchmark specifically designed to evaluate and improve VLMs on quantitative medical image analysis. MedVision spans 22 public datasets covering diverse anatomies and modalities, with 30.8 million image-annotation pairs. We focus on three representative quantitative tasks: (1) detection of anatomical structures and abnormalities, (2) tumor/lesion (T/L) size estimation, and (3) angle/distance (A/D) measurement. Our benchmarks show that current off-the-shelf VLMs perform poorly on these tasks. However, with supervised fine-tuning on MedVision, we significantly enhance their performance across detection, T/L estimation, and A/D measurement, demonstrating reduced error rates and improved precision. This work provides a foundation for developing VLMs with robust quantitative reasoning capabilities in medical imaging. Code and data are available at https://medvision-vlm.github.io.
- Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
- North America > United States > Texas > Grayson County (0.04)
- Europe > Switzerland (0.04)
- Information Technology > Sensing and Signal Processing > Image Processing (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.66)
Unsupervised Representation Learning from Sparse Transformation Analysis
Song, Yue, Keller, Thomas Anderson, Yue, Yisong, Perona, Pietro, Welling, Max
There is a vast literature on representation learning based on principles such as coding efficiency, statistical independence, causality, controllability, or symmetry. In this paper we propose to learn representations from sequence data by factorizing the transformations of the latent variables into sparse components. Input data are first encoded as distributions of latent activations and subsequently transformed using a probability flow model, before being decoded to predict a future input state. The flow model is decomposed into a number of rotational (divergence-free) vector fields and a number of potential flow (curl-free) fields. Our sparsity prior encourages only a small number of these fields to be active at any instant and infers the speed with which the probability flows along these fields. Training this model is completely unsupervised using a standard variational objective and results in a new form of disentangled representations where the input is not only represented by a combination of independent factors, but also by a combination of independent transformation primitives given by the learned flow fields. When viewing the transformations as symmetries one may interpret this as learning approximately equivariant representations. Empirically we demonstrate that this model achieves state of the art in terms of both data likelihood and unsupervised approximate equivariance errors on datasets composed of sequence transformations.
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- North America > United States > California (0.04)
- Europe > United Kingdom > North Sea > Southern North Sea (0.04)
- (6 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Vision (0.93)
- Information Technology > Artificial Intelligence > Natural Language (0.68)
Revisiting Skin Tone Fairness in Dermatological Lesion Classification
Kalb, Thorsten, Kushibar, Kaisar, Cintas, Celia, Lekadir, Karim, Diaz, Oliver, Osuala, Richard
Addressing fairness in lesion classification from dermatological images is crucial due to variations in how skin diseases manifest across skin tones. However, the absence of skin tone labels in public datasets hinders building a fair classifier. To date, such skin tone labels have been estimated prior to fairness analysis in independent studies using the Individual Typology Angle (ITA). Briefly, ITA calculates an angle based on pixels extracted from skin images taking into account the lightness and yellow-blue tints. These angles are then categorised into skin tones that are subsequently used to analyse fairness in skin cancer classification. In this work, we review and compare four ITA-based approaches of skin tone classification on the ISIC18 dataset, a common benchmark for assessing skin cancer classification fairness in the literature. Our analyses reveal a high disagreement among previously published studies demonstrating the risks of ITA-based skin tone estimation methods. Moreover, we investigate the causes of such large discrepancy among these approaches and find that the lack of diversity in the ISIC18 dataset limits its use as a testbed for fairness analysis. Finally, we recommend further research on robust ITA estimation and diverse dataset acquisition with skin tone annotation to facilitate conclusive fairness assessments of artificial intelligence tools in dermatology.
- Europe > Switzerland (0.04)
- South America (0.04)
- Oceania > Australia (0.04)
- (6 more...)
- Research Report (0.64)
- Overview (0.48)
- Health & Medicine > Therapeutic Area > Dermatology (1.00)
- Health & Medicine > Therapeutic Area > Oncology > Skin Cancer (0.57)
Federated Model Aggregation via Self-Supervised Priors for Highly Imbalanced Medical Image Classification
Elbatel, Marawan, Wang, Hualiang, Martí, Robert, Fu, Huazhu, Li, Xiaomeng
In the medical field, federated learning commonly deals with highly imbalanced datasets, including skin lesions and gastrointestinal images. Existing federated methods under highly imbalanced datasets primarily focus on optimizing a global model without incorporating the intra-class variations that can arise in medical imaging due to different populations, findings, and scanners. In this paper, we study the inter-client intra-class variations with publicly available self-supervised auxiliary networks. Specifically, we find that employing a shared auxiliary pre-trained model, like MoCo-V2, locally on every client yields consistent divergence measurements. Based on these findings, we derive a dynamic balanced model aggregation via self-supervised priors (MAS) to guide the global model optimization. Fed-MAS can be utilized with different local learning methods for effective model aggregation toward a highly robust and unbiased global model.
- Europe > Switzerland (0.05)
- Asia > China > Hong Kong (0.05)
- Asia > Singapore (0.04)
- North America > United States > Texas > Grayson County (0.04)
- Research Report (0.82)
- Instructional Material > Course Syllabus & Notes (0.51)
- Instructional Material > Online (0.41)
- Health & Medicine > Therapeutic Area (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Heterogeneous Multi-Agent Reinforcement Learning for Unknown Environment Mapping
Wakilpoor, Ceyer, Martin, Patrick J., Rebhuhn, Carrie, Vu, Amanda
Reinforcement learning in heterogeneous multi-agent scenarios is important for real-world applications but presents challenges beyond those seen in homogeneous settings and simple benchmarks. In this work, we present an actor-critic algorithm that allows a team of heterogeneous agents to learn decentralized control policies for covering an unknown environment. This task is of interest to national security and emergency response organizations that would like to enhance situational awareness in hazardous areas by deploying teams of unmanned aerial vehicles. To solve this multi-agent coverage path planning problem in unknown environments, we augment a multi-agent actor-critic architecture with a new state encoding structure and triplet learning loss to support heterogeneous agent learning. We developed a simulation environment that includes real-world environmental factors such as turbulence, delayed communication, and agent loss, to train teams of agents as well as probe their robustness and flexibility to such disturbances.
- North America > United States > Virginia (0.04)
- North America > United States > Texas > Grayson County (0.04)
- North America > United States > Colorado (0.04)
- Government > Military (1.00)
- Leisure & Entertainment > Games > Computer Games (0.34)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Model-free Control of Chaos with Continuous Deep Q-learning
Ikemoto, Junya, Ushio, Toshimitsu
The OGY method is one of control methods for a chaotic system. In the method, we have to calculate a stabilizing periodic orbit embedded in its chaotic attractor. Thus, we cannot use this method in the case where a precise mathematical model of the chaotic system cannot be identified. In this case, the delayed feedback control proposed by Pyragas is useful. However, even in the delayed feedback control, we need the mathematical model to determine a feedback gain that stabilizes the periodic orbit. To overcome this problem, we propose a model-free reinforcement learning algorithm to the design of a controller for the chaotic system. In recent years, model-free reinforcement learning algorithms with deep neural networks have been paid much attention to. Those algorithms make it possible to control complex systems. However, it is known that model-free reinforcement learning algorithms are not efficient because learners must explore their control policies over the entire state space. Moreover, model-free reinforcement learning algorithms with deep neural networks have the disadvantage in taking much time to learn their control optimal policies. Thus, we propose a data-based control policy consisting of two steps, where we determine a region including the stabilizing periodic orbit first, and make the controller learn an optimal control policy for its stabilization. In the proposed method, the controller efficiently explores its control policy only in the region.
- Asia > Japan > Honshū > Kansai > Osaka Prefecture > Osaka (0.05)
- North America > United States > New York (0.04)
- North America > United States > Texas > Grayson County (0.04)
A Survey of Adaptive Resonance Theory Neural Network Models for Engineering Applications
da Silva, Leonardo Enzo Brito, Elnabarawy, Islam, Wunsch, Donald C. II
This survey samples from the ever-growing family of adaptive resonance theory (ART) neural network models used to perform the three primary machine learning modalities, namely, unsupervised, supervised and reinforcement learning. It comprises a representative list from classic to modern ART models, thereby painting a general picture of the architectures developed by researchers over the past 30 years. The learning dynamics of these ART models are briefly described, and their distinctive characteristics such as code representation, long-term memory and corresponding geometric interpretation are discussed. Useful engineering properties of ART (speed, configurability, explainability, parallelization and hardware implementation) are examined along with current challenges. Finally, a compilation of online software libraries is provided. It is expected that this overview will be helpful to new and seasoned ART researchers.
- South America > Brazil > Federal District > Brasília (0.04)
- Oceania > New Zealand (0.04)
- North America > United States > Texas > Grayson County (0.04)
- (8 more...)
- Research Report (1.00)
- Overview (1.00)
- Leisure & Entertainment > Games > Computer Games (1.00)
- Government (1.00)
- Health & Medicine (0.92)
- Education > Educational Setting (0.67)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
- (3 more...)
Advancing GraphSAGE with A Data-Driven Node Sampling
Oh, Jihun, Cho, Kyunghyun, Bruna, Joan
As an efficient and scalable graph neural network, GraphSAGE has enabled an inductive capability for inferring unseen nodes or graphs by aggregating subsampled local neighborhoods and by learning in a mini-batch gradient descent fashion. The neighborhood sampling used in GraphSAGE is effective in order to improve computing and memory efficiency when inferring a batch of target nodes with diverse degrees in parallel. Despite this advantage, the default uniform sampling suffers from high variance in training and inference, leading to sub-optimum accuracy. We propose a new data-driven sampling approach to reason about the real-valued importance of a neighborhood by a non-linear regressor, and to use the value as a criterion for subsampling neighborhoods. The regressor is learned using a value-based reinforcement learning. The implied importance for each combination of vertex and neighborhood is inductively extracted from the negative classification loss output of GraphSAGE. As a result, in an inductive node classification benchmark using three datasets, our method enhanced the baseline using the uniform sampling, outperforming recent variants of a graph neural network in accuracy.
- Asia > South Korea > Seoul > Seoul (0.05)
- Africa > Zambia > Southern Province > Choma (0.05)
- North America > United States > Texas > Grayson County (0.04)
- North America > United States > New York > New York County > New York City (0.04)
Views of AI, robots, and automation based on internet search data
Artificial intelligence, robots, and automation are rising in importance in many areas. As noted in the recent book, "The Future of Work: Robots, AI, and Automation," there are exciting advances in finance, transportation, national defense, smart cities, and health care, among other areas. Businesses are developing solutions that improve the efficiency and effectiveness of their operations and using these tools to improve the way their firms function. Yet there also are concerns about the impact of these developments on jobs and personal privacy. A Pew Research Center national survey revealed considerable unease about emerging trends.
- North America > United States > California > San Francisco County > San Francisco (0.15)
- Asia > China (0.06)
- North America > United States > Virginia > Albemarle County > Charlottesville (0.05)
- (16 more...)
- Transportation > Ground > Road (1.00)
- Information Technology (1.00)
- Automobiles & Trucks (1.00)
- Government > Regional Government (0.70)
Apple Boosts Face ID Tech Maker Finisar
Finisar recently started shipping production quantities of its vertical-cavity surface-emitting lasers, or VCSELs. The device is critical to Apple's Face ID recognition system, providing the 3-D sensing capabilities that detect a face. It is part of a feature Apple calls the "Dot Projector" that uses a laser to beam 30,000 infrared dots across the face. The company ran into complications assembling that part earlier this year, creating a production snag for the iPhone X. Apple is expected to roll the Face ID technology across more devices such as the iPad in the months and years ahead, said Gene Munster, managing partner at Loup Ventures, a venture-capital firm specializing in tech research.
- North America > United States > Texas > Grayson County > Sherman (0.08)
- Asia > Taiwan (0.08)
- Banking & Finance (0.41)
- Information Technology (0.40)
- Information Technology > Artificial Intelligence > Vision > Face Recognition (0.88)
- Information Technology > Communications > Mobile (0.61)